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Abstract: Collisions of composite particles impose an arbitrary boost in the longitudinal 
direction on a given event. This implies that the centre-of-mass frame at hadron colliders 
is undetermined for processes with missing energy in the final state. This motivates the 
modification of the Fox-Wolfram moments such that the moments for a given event are 
identical when viewed in the lab or centre-of-mass frame of the beam. The resulting mo¬ 
ments are invariant under rotations in the plane transverse to the beam and boosts parallel 
to the beam. These moments are then used to demonstrate improved signal separation in 
the channel where the Higgs decays to two b-quarks while being produced in association 
with a vector boson. 
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1 Introduction 

The complexity of final states being studied at hadron colliders motivates the use of topo¬ 
logical variables. The topological variables being used at the Large Hadron Collider (LHC) 
were often developed in the context of electron-positron colliders. This is true of the Fox- 
Wolfram moments (FWMs). The FWMs have been applied in the context of the ATLAS 
and CMS experiments m but are not widely used. In contrast, the B-factories, such as 
Belle and BaBar, use the FWMs extensively to partition phase space In particular, 

the FWMs are used in the algorithms to suppress the continuum background [6]. The 
FWMs partition phase space in a way which is natural for B-factories but not always nat¬ 
ural for hadron colliders. The FWMs correspond to a decomposition of the event’s phase 
space into Fourier modes on the surface of a sphere. In the context of hadron colliders, it 
would be desirable for a harmonic analysis to be invariant under Lorentz boosts parallel to 
the direction of the beam. Incorporating and evaluating this change will be the subject of 
this submission. 

2 Overview of Fox-Wolfram Moments 

The FWMs are defined as [7H9]: 
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( 2 . 1 ) 


m=—l i=l 
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N 

where are the of spherical harmonics, ^ is the sum over all reconstructed objects or 

i=l 

particles, y/s is the centre-of-mass energy of the collision and 6i, (pi and pi are the i''^ object’s 
spherical coordinates and momentum in the event’s centre-of-mass frame. The moments 
can be written in terms of the angular distance between each hnal state object: 

N 

^1= \Pi\ \Pj\Pi {cosVtij) , (2.2) 

using the addition formula for the spherical harmonics: 

, m=l 

Piicosn^j) = ^ (2.3) 

m=—l 

where Pi is the Legendre polynomial of order I and Qij is the angular distance between 
particle i and j. In this form, the invariance under rotation is made manifest as the 
dependence on the arbitrary axis in equation |2.1| disappears. The moments are typically 
normalised to the zeroth moment; this corresponds to a uniform rescaling for electron- 
positron colliders but is more significant in hadron colliders. 

Intuitively, the Fox-Wolfram moments describe how compatible the event topology is 
with each of the spherical harmonics. The utility of the FWMs arises because the moments 
form an orthogonal basis and are invariant under the set of rotations, 50(3). The rotational 
symmetry reflects the symmetry of the particle collision in the event’s centre-of-mass frame. 
In effect, this means that any rotation of a given event will not change the moment (up to 
and excluding detector effects). Similarly, orthogonality removes the redundancy between 
different moments. When applied correctly, this allows the most important features of an 
event to be reduced to the lower order moments, with higher order moments describing 
features of the event dependent on finer resolutions. The FWMs do not contain enough 
information to reconstruct the energy-density because all information about phases has 
been removed. They do, however, allow the reconstruction of the energy-density correlation 
function. 

3 Limitations of the Fox-Wolfram Moments 

In hadron collider experiments, the Fox-Wolfram moments no longer describe the symmetry 
of the detector and are no longer orthogonal. The limitation of hadron colliders is that the 
centre-of-mass frame of the event cannot be accurately reconstructed for many processes; 
particularly processes with hnal states containing missing energy or where objects are mis- 
measured jets. In e'’'e“ colliders, the centre-of-mass energy of the collision is known and 
the event can always be boosted from the lab frame into the centre-of-mass frame. By 
contrast, hadron colliders can only determine the transverse missing energy, so for hnal 
states with large missing energy, the event’s four-vector sum cannot be reconstructed. The 
use of FWMs in this context implicitly assumes that the event is produced at rest in the lab- 
frame. This motivates the creation of a new set of moments that, unlike the Fox-Wolfram 
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moments, are invariant under longitudinal boosts rather than rotations in the polar angle, 
9. Both moments are invariant under rotations in the transverse plane. For brevity, the 
new moments will be referred to as the Hadron Collider Moments (HCMs). 

A further weakness of the FWMs is that the orthogonality condition for the spherical 
harmonics. 


ymym ^ A'k5iv5^ 


(3.1) 


no longer holds because of the incomplete coverage of the detectors. For example, the 
ATLAS detector’s inner tracker has a coverage in rapidity of |r/| < 2.5, corresponding to 
9 K, 40° missing out of 360°. Over this reduced integral, equation 3.1 becomes: 


ymym ^ Att5iv5„ 


(3.2) 


and hence the spherical harmonics no longer form an orthogonal basis. This limitation 
also applies to experiments where the centre-of-mass frame can be reconstructed. This 
problem is identical to that faced by the Cosmic Microwave Background (CMB) mapping 
experiments where the all-sky spectrum is expressed as a Fourier-Legendre series. In CMB 
experiments, the galactic plane must be excluded (again approximately 40° of 360°). To 
remedy this, a simple orthogonalisation procedure can be adopted identical to that used by 
the CMB experiments |10) . This limitation is not present in the HCMs. 


4 Symmetries of the Lab Frame in Hadron Colliders 


The natural symmetries of an event in the centre-of-mass frame is 50(3), or the two rota¬ 
tions of By contrast, the natural symmetries of an event in the lab frame of a hadron 
collider experiment is 50(2)'^ x 50(1,1)^, where 50(2)^ refers to rotations in the plane 
transverse to the beam and 50(1, l),g refers to the Lorentz boosts parallel to the beam. 
This motivates the use of the standard coordinate system of hadron colliders: 


p = mr cosh (y), =pt cos 4>, py =pT sin (p, = mr sinh (y), (4-1) 


where the rapidity, y, is defined as: 


y = 



E + p 
E — p 


The rapidity if often approximated by the pseudorapidity: 


rj = — log 



(4.2) 


(4.3) 


which is valid in the limit that the mass of the particles are negligible relative to their 
energy. At the LHC this is valid for leptons but may not be valid for jets which often 
have much larger masses. Similarly, the transverse mass, m-T, is often approximated to the 
transverse momentum, pT, in this limit. This coordinate system has the advantage that 
the symmetries of the system are manifest with the invariance of both Ay and A(/) under 
longitudinal boosts and rotations about the transverse plane. 
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It is also useful to define an invariant distance in this coordinate system: 

AR = VAr/2 + A(/.2 = rI + rI- 2 R 1 R 2 cos(A7), (4.4) 

where A 7 is the angle between the two radii in rj-cj) space. This contrasts with |2.2| where 
the distance on a great circle, fl, is given by: 

cos riij = cos{4>i) cos{4>j) + sin(i;i)j) sin(i;i)j) cos(0j — 9j). (4-5) 


5 Proposed Moments 


The FWMs are equivalent to a Fourier series on the surface of a sphere, also known as a 
Legendre series. Equation |4.4| describes a cone with radius AR. In this case, it is necessary 
to take a Fourier series on the radial component of a cylinder, also known as a Fourier-Bessel 
series. Together with 7 , the polar angle of R in (y, (j)) space, this forms a set of cylindrical 
coordinates with the metric: 


ds^ = rri^dy^ + p^dcj)^ ^ [dR? + R?d'y^) 


(5.1) 


The new set of moments can be calculated for the new symmetry by solving the Lapla- 
cian for the metric in equation 5.1 Alternatively, the Fourier expansion can be rewritten 
in terms of 7 and R and the moments inferred. This can be seen from the partial wave 
expansion: 


^-ik-r _ —ikrsin{9) _ 


g .... ^ ^ (fcr) (5.2) 


m=—00 

where Jm is the Bessel function. The partial wave expansion can be used to rewrite 
the Fourier series as: 


00 00 


=Y, E (5.3) 

n=l n=lm=—oo 

As with the FWMs, the HCMs describes the Fourier modes of the density correlation 


function and not the density. Using the equation 5.3 a set of modes can be constructed 


which are analogous to equation 2.1 




N 




2m7 


2 = 1 


(5.4) 


The infinite sum in this equation limits the usefulness of this form. The infinite sum can 
be removed by applying the addition theorem for Bessel functions, 

00 

Jo(Ai2i,2)= J] (5.5) 

m=—oo 

where ARf 2 = Ri+ R 2 — 2 R 1 R 2 cos 7 * 7 . The resulting equation has the dependence on an 
arbitrary axis removed and is analogous to equation | 2 . 2 | 

Sn = YPT,iPT,jJo {kiARij). 


(5.6) 








The coefficients of this expansion, ki, are defined by the boundary conditions. Using the 
Dirichlet boundary condition that the energy-density correlation function becomes zero 
outside of the detector acceptance region, the coefficients, ki, corresponds to the zero of 
Jq. For convenience, uq is set to zero. The moments are normalised to the zeroth HCM in 
the same manner as the FWMs. The dependence on the arbitrary axes disappears and the 
symmetries become manifest. Using these new moments, an event can be expected to give 
the same value whether they are measured in the lab or centre-of-mass frame. 

Information about the particle identihcations can be incorporated into both the FWMs 
and the HCMs by splitting up the summand as follows: 

E = E + E (5,7) 

i,j=all ij=iets 2,ji'=leptons 2=jets,j'=leptons 

This allows the moments to be divided into several moments consisting of each of these 
summands: 

~ ‘S'ijleptonxlepton T 'S'zjetsxjets T 2 X Szjgptonsxjets- (5.8) 

Equation |2.2| is a Fourier-Bessel series which can also be interpreted as the Hankel 
transform on a discrete interval. The Hankel transform is linked to the Abel transform by 
the Projection-Slice theorem |12| . In this theorem, the Hankel transform is equivalent to 
the Abel transform view in Fourier space: 

T-A = n. (5.9) 

The Abel transform of a function, /(r), can be written in the following form: 

OO 

-4= y / (\/x2 dy. (5.10) 

— OO 

This allows the moments to also be interpreted as examining, in Fourier space, the projection 
of the density correlation function in (r/, (p) space along parallel lines of constant (p. 

6 Application to Associated Production of H bb 

As a simple test of the utility of the HCMs relative to the FWMs, the moments are applied 
to the all-hadronic hnal state of the H —)• bb produced by Higgs-strahlung from a W- or 
Z-boson. This process was chosen because of the complexity of its final state, and it is a 
new channel which offers the potential to increase the explored final states of the process 
H —bb at the LHC. One of the main backgrounds for this signal is the production of tt and 
tt + jets. The moments are tested for their discriminating power against this background. 

6.1 Simulation and Monte-Carlo Generation 

The simulated events used to test the new moments are generated using the MadGraph jl3) 
and PythiaS [2] simulation framework with the ATLAS detector simulated by Delphes 
|15j with proton-proton collisions at ^/s = 13 TeV. A filter is placed on the jets which 
corresponds to a 17 GeV cut on jet pT at the parton level and a 20 GeV cut was used at 
reconstruction level. The background sample includes up to three additional jets. The hnal 
state requires at least two jets which are tagged as coming from the decay of a b-quark. 
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Feature importances of HCM and FWM 
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Figure 1: The distribution of feature improtance from the random forest when testing 
the separation of the all-hadronic hnal state for the signal H —)■ bb, produced in association 
with a IT- or Z-boson, from the backgrounds consisting of top quark pair production with 
additional jets. The distribution of feature importances is shown for the FWMs (blue) 
and the HCMs (green) with the moments arranged in decreasing importance. The feature 
importance in each tree is dehned as the normalised reduction in node impurity brought by 
all splits on that feature in the tree. The overall features importance is the average feature 
importance across all trees. For brevity, only the hrst eight moments were calculated for 
both the FWMs and HCMs. The corresponding index for each moment is shown on the 
x-axis. 


6.2 Quantifying Separation 

The separation of these two processes is quantihed by the integrated area under the receiver 
operating characteristic (ROC) curves. In addition to the ROC curves, a simple random 
forest m is constructed to obtain the separation which includes the effect of correlations. 
A random forest was chosen for this task because it is able to more fully explore the full 
phase space provided by these variables in a less biased way. At each decision node in the 
decision tree, the best cut from a random subset of the input variables is performed; by 
contrast a classic decision tree is usually deterministic. Even when boosted, a decision tree 
will tend to look fairly uniform and it will consistently cut on a few dominant moments in 
the first few decision nodes. This prevents the full phase space of the moments being fully 
explored. 
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The feature importance is generated by the random forests and consists of the nor¬ 
malised reduction in node purity brought by all splits on that feature averaged across all 
trees in the random forest (with the sum of all scores normalised to one) |17| . The distribu¬ 
tion of feature importances for the hrst eight moments of the FWMs and HCMs are shown 
in hgure[^ 

6.3 Analysis 

Figure shows greater separation between signal and background when using the HCMs 
over the FWMs for the hrst eight moments. The moment which demonstrates the most 
separation for the HCMs are shown in Figureand the feature importance and ROC curves 
for the HCMs and FWMs are shown in and respectively. The HCMs, in particular the 
eighth moment, show consistently better performance and separation than the FWMs. This 
is rehected by both the random forest’s ranking of feature importance and the ROC curves. 

The use of the FWMs and the HCMs are not mutually exclusive. Both provide informa¬ 
tion based on two different limits. The FWMs approximate the hnal state as being produced 
at rest while the HCMs assume that the boost is arbitrary. For the very heavy hnal mass 
states, the FWMs are expected to improve because heavier mass states are generally very 
close to rest in the rest frame of the two colliding protons. 

7 Conclusion 

The inability to determine the centre-of-mass frame of the collision in events with missing 
energy causes the centre-of-mass to only be determinable in the transverse plane. This 
causes the standard rotational symmetries implicit in the Fox-Wolfram moments to no 
longer hold. Hence, the same underlying event topology can lead to different Fox-Wolfram 
moments due to the varying longitudinal boost. To remedy this, a new set of topological 
moments are proposed for hadron colliders which rehect the underlying symmetry of the lab 
frame. The use of HCMs and FWMs is complementary with both moments representing 
limiting cases. The HCMs can be incorporated into analyses in the same way as the FWMs 
are; individual moments can be used as cuts or inputs for a multi-variable analysis. 
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Figure 2: The ROC curves and their respective areas for the FWMs (top) and HCMs 
(bottom) used in testing the separation of the all-hadronic final state for the signal H —?■ bb, 
produced in association with aW-or Z-boson, from the backgrounds consisting of top quark 
pair production with additional jets. 
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HCM 8 


Figure 3: The distribution for the eighth order HCM {I = 8) for the all-hadronic final 
state for the signal H —)• bb (blue), produced in association with a W- or Z-boson, from 
the backgrounds consisting of top quark pair production with additional jets (red). This 
moment was shown to have the most separation of the HCMs (for I < 8). 
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